Optimal Worker Quality and Answer Estimates in Crowd-Powered Filtering and Rating

نویسندگان

  • Akash Das Sarma
  • Aditya G. Parameswaran
  • Jennifer Widom
چکیده

We consider the problem of optimally filtering (or rating) a set of items based on predicates (or scoring) requiring human evaluation. Filtering and rating are ubiquitous problems across crowdsourcing applications. We consider the setting where we are given a set of items and a set of worker responses for each item: yes/no in the case of filtering and an integer value in the case of rating. We assume that items have a true inherent value that is unknown, and workers draw their responses from a common, but hidden, error distribution. Our goal is to simultaneously assign a ground truth to the item-set and estimate the worker error distribution. Previous work in this area (Raykar and Yu; Whitehill et al.) has focused on heuristics such as Expectation Maximization (EM), providing only a local optima guarantee, while we have developed a general framework that finds a maximum likelihood solution. Our approach extends to a number of variations on the filtering and rating problems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimal Crowd-Powered Rating and Filtering Algorithms

We focus on crowd-powered ltering, i.e., ltering a large set of items using humans. Filtering is one of the most commonly used building blocks in crowdsourcing applications and systems. While solutions for crowd-powered ltering exist, theymake a range of implicit assumptions and restrictions, ultimately rendering them not powerful enough for real-world applications. We describe two approache...

متن کامل

Globally Optimal Crowdsourcing Quality Management

We study crowdsourcing quality management, that is, given worker responses to a set of tasks, our goal is to jointly estimate the true answers for the tasks, as well as the quality of the workers. Prior work on this problem relies primarily on applying ExpectationMaximization (EM) on the underlying maximum likelihood problem to estimate true answers as well as worker quality. Unfortunately, EM ...

متن کامل

Argonaut: Macrotask Crowdsourcing for Complex Data Processing

Crowdsourced workflows are used in research and industry to solve a variety of tasks. The databases community has used crowd workers in query operators/optimization and for tasks such as entity resolution. Such research utilizes microtasks where crowd workers are asked to answer simple yes/no or multiple choice questions with little training. Typically, microtasks are used with voting algorithm...

متن کامل

Optimization techniques for human computation-enabled data processing systems

Crowdsourced labor markets make it possible to recruit large numbers of people to complete small tasks that are difficult to automate on computers. These marketplaces are increasingly widely used, with projections of over $1 billion being transferred between crowd employers and crowd workers by the end of 2012. While crowdsourcing enables forms of computation that artificial intelligence has no...

متن کامل

Tuning the Diversity of Open-Ended Responses From the Crowd

Crowdsourcing can solve problems beyond the reach of state-of-the-art fully automated systems (Bigham et al. 2010; Lasecki et al. 2011; 2012; Bernstein et al. 2011; von Ahn and Dabbish 2004; Attenberg, Ipeirotis, and Provost 2011; Aral, Ipeirotis, and Taylor 2011). A common pattern found in many such systems is for the workers to discover, in parallel, a number of candidate solutions and then v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014